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The general problem of computing the false-alarm rate vs. detection-threshold relationship for 
a bank of correlators is addressed, in the context of maximum-likelihood detection of gravitational 
waves, with specific reference to chirps from coalescing binary systems. Accurate (lower-bound) 
approximants for the cumulative distribution of the whole-bank supremum are deduced from a 
class of Bonferroni-type inequalities. The asymptotic properties of the cumulative distribution are 
obtained, in the limit where the number of correlators goes to infinity. The validity of numerical 
simulations made on small-size banks is extended to banks of any size, via a gaussian-correlation 
inequality. The result is used to estimate the optimum template density, yielding the best tradeoff 
between computational cost and detection efficiency, in terms of undetected potentially observable 
sources at a prescribed false-alarm level, for the simplest case of Newtonian chirps. 

PACS numbers: 



I. INTRODUCTION. 

In the next few years a number of optical-interferometric detectors of gravitational waves will be in operation [lj- 
0, supplying steady fluxes of data to be sieved in the search of gravitational wave (henceforth GW) signatures of 
astrophysical origin. Among expected signals, those of known shape, except for the values of a set of parameters which 
embody the relevant information on the source physics, are specially valuable to perspective gravity wave astronomy 
The maximum- likelihood strategy for detecting signals of known shape in additive stationary gaussian colored 
noise consists in projecting the data on a finite discrete set of expected waveforms (the so called templates) , and 
using the largest projection A for computing a detection statistic. The detection statistic A is a random variable. 
Letting 

Fa(x) = Prob[A < a;|no signal], (1.1) 

it is possible to determine uniquely a threshold 7 such that: 

Prob[A > 7|no signal] = 1 - Fa (7) = a (1.2) 

where a is a prescribed false-alarm probability. The detection algorithm accordingly consists in comparing the detection 
statistic A to the threshold 7. Whenever A > 7 one declares that a signal has been observed at the false-alarm level 
a. The unknown source parameters are then estimated from those of the template yielding the largest projection. 
Deriving an exact (or suitably approximate) representation of the distribution function F\(x) is clearly a key problem 
for detector's efficiency optimization. In this paper we present a thorough analysis and a possible working solution 
of this problem, for the special relevant case of gravitational wave chirps from coalescing binary stars. In particular, 
we present the formal deduction of several partial results anticipated in 0,@- As a by-product, previous findings by 
Dhurandhar, Schutz, Krolak and Mohanty |9|-|l2| are recovered and hopefully generalized. The paper is organized as 
follows. In Sect. [HJ gravitational wave chirp templates and correlators are introduced, the minimal match is defined, 
and the statistical covariance among different correlators is computed, for the special relevant case where no signal 
is present in the data. This section is included to make the paper self-contained and to fix the notation, and could 
perhaps be skipped by the expert Reader. In Sect. II I II the minimal-match issue is reviewed, and the interplay between 
template density, detection threshold and detector's efficiency (fraction of detected events among those potentially 
observable) is discussed. The study of l|l.ll) is undertaken in Sect. IIVI Available results are first reviewed. A number 
of accurate lower-bound (conservative) approximants of F\(cc) are then discussed. The abstract asymptotic properties 
of F\(x) in the limit where the number of correlators N — > 00 are next highlighted. It is finally shown that one can 
capitalize on a classical inequality to extend the validity of numerical simulations made on moderate-size template 
banks to banks of arbitrary (large) size. In Sect. 0the minimal-match issue is reconsidered, in the light of results 
obtained in previous sections. The main conclusion is that, in contrast to common belief, increasing the template 
density in parameter space beyond a certain limiting value does not improve the detector's performance (fraction 
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of detected events among those potentially observable) in any significant fashion. It is pointed out that in order to 
estimate the above limiting value, the accuracy of the no-signal cumulative distribution is a key ingredient. 

Conclusions and recommendations follow under Sect. IVII 

The core results of this paper (especially those in Sect. llVfl rely on a number of rather technical arguments, which 
are collected in Appendix A and B. 



II. CHIRP TEMPLATES AND CORRELATORS 



The (spectral) projection of the data A(f) on a template T(f) is the (antenna noise-weighted) scalar product 

' r/. d f 



(A, T) = 2Re ■ 
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A(f)T(f) 



(2.1) 



also called the correlator between A and T. In 12. A(f) = S(f)+N(f), where S(f) is a (possibly null) signal, -/V(/) 
is a realization of the antenna noise, (fi, f s ) is the useful antenna spectral window, II(/) is the antenna noise power 
spectral density, and * denotes complex conjugation. 

The stationary-phase asymptotic representation of a post-newtonian chirp waveform can be written |13| : 



H(f, <$>, r, = ffor 7/6 exp { j [2nfr - + V(/, £)] } 



(2.2) 



where Hq is a (complex, unknown) constant, depending on the source position and (relative) antenna orientation, r is 
the nominal coalescency time |l4j . <f> is the phase at t = r and ip(f, £) is a function of the intrinsic source parameters 
only (i.e., the binary companion masses and spin parameters), represented by the vector £. Equation 12.211 is used to 
model the sought signal as well as to construct the templates. Suffixes S and T will be used, when needed, to identify 
the signal and template, respectively. Templates are usually constructed with unit norm, viz.: 
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|T|| = (T,T) 1/2 = 1. 



The maximum- likelihood detection statistic is (any monotonically increasing function of): 

A = sup sup sup (A, T) , 

|* T T T cf>T 

where the supremums are taken on the set of available templates. 

The correlators (I2.1|l can be readily maximised w.r.t. the irrelevant parameter <pT, yielding: 
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A( f\f-7/6 p -3[^frT+i>(f,iT)]_4L- 
[J)J n(/) 



fs 



f 



-7/3. 



fi 



df 

n(/) 



1/2 



(2.5) 



The r.h.s. of H2.5JI is called the noncoherent correlator. In view of eq. Ij2.5|l all templates can be generated with 
4>t = 0. The integral in (|2.1(l is then recognized (but for inessential factors) as the / — » tt Fourier transform of the 
function: 



K A (f,Z T ) 



,A (/)r*(/, o,o,60 
n(/) 



/ e ifij s ) 

elsewhere. 



(2.6) 



One is therefore led to construct a family of reduced templates with (f>T = tt = 0, parameterized in £x. Further 
maximisation of the noncoherent correlator (|2.5(l w.r.t. tt can thus be accomplished by taking the largest absolute 
value of the / ^ tt Fourier transform of 1)2. 6fl . The resulting quantity 



C[A,T] = su P c[A,T] - su P sup(A,T) = sup F f ^ T [K A (f,( T )] 



(2.7) 
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is called the reduced noncoherent correlator, and is a function of the intrinsic source and template parameters only. 
The detection statistic can thus be written 

A = sup C[A, T]. (2.8) 



A. The Minimal-Match Prescription 



The similarity (matching) between a signal S(f) and a (non reduced) template T(f) is measured by their overlap, 

(S,T) 



0[S, T] 



\\S\\-\\T\\ 



< 1 



where the norm is defined in eq. i|2.3[) . 

Given a (discrete) family of templates C, and a set S of admissible signals, the quantity 

FF(C,S) = inf sup 0[S,T] 
Ses Tec 



(2.9) 



(2.10) 



is called the fitting- factor between C and S [l5| . 

The probability of detecting (at any fixed false-alarm level) a known signal S{f) hidden in the noisy (spectral) data 
A(f) using (A, T) as a detection statistic is an increasing function of the overlap 0[S,T]. Thus, given S, the 
(discrete, non-reduced) template family C is designed in such a way that 



FF(C,S) > r, 

where the physical meaning of T will be discussed in Sect. IIIII It is convenient to define the function 



M[S,T] = sup sup 0[S,T] 

TT 4>T 

called the match between S and T 0], and to rephrase (|2.11|) as follows: 

VSeS, 3TeC: M[S,T] > T. 

The quantity T is referred to as the minimal match. 

The match can be readily computed in terms of the reduced templates and waveforms: 



M[S,T] = 
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where, from eq. (|2.2(l 
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More or less obviously, 



V£ g ,£ T , 0<M[S,T}<1, 



(2.11) 
(2.12) 

(2.13) 



(2.14) 



(2.15) 



(2.16) 



the equality holding iff S = T. 



B. The (Reduced) Template Lattice 

We shall assume the reduced waveforms/templates to be parameterized in such a way that tp(f,£s) — ^(f^r) is a 
function of the difference £s ~ £t o nly [l7j |. Under this assumption, to leading order in the (small) difference £s — £r, 
the match can be cast in the form [lfil ]: 

M[S,T]«1-(£ s -£t) 2 (2.17) 
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and the mimimal-match condition becomes: 

y( s e s\ 3( T g a ■. ds ~ irf < i - r, (2.i8) 

where S' C S and £' C £ are the sub-manifolds corresponding to = r = 0. The ball 

S fc = ■■ ds - 6c) 2 < 1-r} (2.19) 
is called the span of = T(6c) at the minimal-match level T, and the minimal-match prescription is further rephrased 



as: 



(2.20) 



In the simplest case, the ik can be chosen to form a cubical lattice [T^. Then, the distance between nearest-neighbour 
reduced templates (cubical-cell sidelength) is 



A = 2 



i-r 
p 



1/2 



(2.21) 



where P is the dimension of £. Hence, the template density is A p , and the template number is of the order of 
the volume of S' divided by the elementary cell volume A p . 



C. The Pure-Noise Case Correlator Covariance Matrix 



In deducing the (no-signal) cumulative distribution of the whole-bank supremum it is expedient to deal with the 
following squared quantities: 



2 2 2 



(2.22) 



in view of the exponential (no-signal) distribution of the random variable Z. The sought transformation does not affect 
detector performances, X being an equivalent sufficient detection statistic. Equations (|2.22() will be used systematically 
throughout. Letting 
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where 



JV(/)/ -r/6 e - J - W ,Q 
0, elsewhere, 



one readily gets, in view of eq.s (|2.5|) and (|2.22|) : 



Z = \Y(t,0\ 2 , Z = sup Y(t,0 



X = sup sup 
« r 



(2.23) 



(2.24) 



(2.25) 
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The noise being assumed as stationary zero-mean colored gaussian, Y(r, £) is a zero-average complex gaussian random 
field. It is therefore completely characterized by its second-order joint expectation value (covariance), viz.: 



f, 



f-7/3 -j{27r/(T-0+[^(/,i3-^(/,f)]} d f 
* f-7/3 d f 

u n (/) 



(2.26) 



It is readily seen that: 



E[Y(r,OY*(T',e) 



c[T,T'} 
\T\\ • I IT' 



(2.27) 



D. Discrete Implementation 

Let Na the total number of reduced templates used to cover S' . Each reduced template yields as many (non- 
coherent) correlators as the number of time bins in the discrete Fourier transform (henceforth DFT used to 
compute numerically eq. I|2.5|l . The above number will be denoted by Nq. The detection statistic is thus: 



A = max max 

k£[l,N A ] he[l,N e ] 



DFT h [K A (f,£ k )] 



(2.28) 



where the integers h and k span the DFT output time bins, denoted as DFTh[-], and the (discrete) reduced template 
family, respectively. 

The match between a signal S and a template T is computed as 



M[S,T]= max 

he[l,N ] 



DFT h [K s (f,tk)] 



' f-7/3 $ 

n n (/) 



(2.29) 



As an effect of time-discretization, M[S,S] computed with 12.29f) can be less than one. The DFT time step (and 
length) should be accordingly chosen in such a way that 



VSeS', 1 -M[S,S] < 1 -r. 

We define the following zero-average gaussian random suite, which is the discrete counterpart of eq. (|2.23|l 



Y, 



h,k 



fs f-7/3 ^ 

u n (/) 



-1/2 



DFT h [K N (f,£ k )], h = l,2,...,N e , k = 1,2, . . . ,N A , 



which is completely characterized by the covariance matrix 

a(k - h, C» - L) = E [Y h , m Y* in ] . 
We note that cr(0, 0) = 1 by construction, and further define 

p M = max \a(q, £ n - $ m )|, (q, £ n - f m ) ^ (0, 0). 
Note that, in view of eq. H2.30|l one has 

pm = k(i,o)| 



(2.30) 



(2.31) 



(2.32) 



(2.33) 



(2.34) 



which means that the largest (absolute) covariance always occurs between correlators differing only in t by a single 
DFT time bin. 
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III. MINIMAL MATCH VS. UNDETECTED OBSERVABLE SOURCES 



The minimal match is more or less obviously related to the fraction of potentially observable sources which would 
be undetected as an effect of template mismatch. It was first argued in 15] that (under the simplest assumption of 
isotropically and homogeneously distributed sources) this latter would be simply (1 — T 3 ). 

In order to gain insight into this issue, it is expedient to work in terms of the detection probability Pd- Accordingly, 
let 



P.i 



1-/3 = 1 -Q[ 7 -£(A)] 



(3.1) 



where Q(-) is an unknown CDF, whose density can be nonetheless safely assumed as being unimodal and center- 
symmetric, at least for meaningful values of the signal-to-noise ratio. In the worst case of minimal match, 



£(A) re Vol, 

where d is the signal deflection (signal-to-noise ratio), viz.: 



(3.2) 



fs S(f)S*(f) 
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(3.3) 



and we assume rf> 1. Let the observable sources be those for which Pd > 0.5. Under the above assumptions for Q(-), 
these correspond to: 



d>l. 



(3.4) 



The deflection d on the other hand can be written 



A K 

d= R 



(3.5) 



where R is the source distance, and K is a constant depending on the antenna and source features, as well as their 
mutual orientation. Equation l|3.4|l allows to define the radius R of the antenna-centered sphere which contains all 
observable sources of a given class (seen by the same antenna, sharing the same features and orientation), viz.: 



R = 



AT 

1 



(3.6) 



Hence the number of observable sources in the given class, under the most obvious hypothesis of uniform (homogeneous 
and isotropic) spatial source density p s is: 



N s = g7rp s 



AT 



(3.7) 



Letting T 



N, 



(max) 



max, N max the largest achievable minimal match, and the related number of templates, and denoting as 



the corresponding value of N s , one has from H3.7(l : 



(max) 



N. 



(3.8) 



where the dependence of 7 from a and N (and hence on T itself) has been written explicitly. The quantity 1 — r) 
represents the fraction of observable sources which would be lost due to poor minimal match (r < T max ). Note that 
the ratio 77 does not depend on K. 

Neglecting the dependence of 7 on N, the second factor on the r.h.s. of l|3.8|l cancels, and we are left with 



(3.9) 



which does not depend on a. Under the same assumption, N s in l|3.7|) increases monotically with T, attaining its 
supremum ]\f( max ) a t T max = 1, yielding 7/0 = T 3 . This is, in essence, the simplest argument introduced in |15). 
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This argument conceals an obvious naivety insofar as it neglects the dependence of the detection threshold 7 
corresponding to a given false-alarm probability a on the total number of templates N — NqN&, and hence on T 
itself. 

In order to refine this reasoning one should fully take into account the dependence of 7 on N, which in turn is 
affected by the covariance among the correlators. We shall postpone the relevant results of such a refined analysis 
until the end of Sect. [V] after having introduced the needed no-signal CDF of the detection statistic A, which is the 
subject of the next section. 



IV. NO-SIGNAL WHOLE-BANK STATISTICS 



In order to provide a "working" solution to the problem of computing the whole-bank false- alarm probability, we 
should be able in principle: 

• to provide a hopefully decent approximant of Fx ; 

• to guarantee that the above approximant is otherwise a lower bound for the true CDF, so that one makes a 
conservative error while using it to compute the threshold corresponding to a given false-alarm probability; 

• to formulate a criterion whereby the validity of simulations made on relatively small-sized banks could be 
extended to huge (real-world) banks, for which direct numerical simulations are either unwieldy or unfeasible. 

To make the dependence of Fx(x) on the number N of templates explicit, we shall denote it in this section by 

F^\x) = Probpf < a:|no signal]. (4.1) 



A. Background 

In the ideal case where the correlators are statistically independent, one has simply: 

F x N \x) = [1 - expt-z)]" =: $£\x). (4.2) 

In all practical cases, the covariance among the correlators is nonzero, and hence the independent-correlator formula 
(|4.2|) is poorly accurate. However, in Appendix A we prove that, for any finite N and any x, the independent-correlator 
approximation <&x is a lower bound for Fj^', viz.: 

Fj*>*™ (4.3) 

thus implying a conservative error (for any fixed threshold it gives a lower false alarm rate). 

It was first suggested by Dhurandhar and Schutz 9] that eq. I|4.2[l can be made more accurate after reducing N by 
a suitable factor e £ [0, 1[. Extensive numerical simulations supporting this suggestion were subsequently reported in 
10] by Dhurandhar and Mohanty J2jj. In Jaranowski et al. 0| it was heuristically argued that 

(4.4) 

where h(t]t), ^(Vc) ar e the measures of the span and correlation neighbourhood |2l| of a template, respectively |ll| . 
which are both functions of the prescribed minimal match T. In 12] Mohanty introduced an elegant formal argument 
whereafter an approximate closed-form expression for e could be derived |23. Mohanty 's argument is based on the 
following key assumptions, valid for suitably large values of the threshold: 

i) that at most two (squared) correlators, say Z, Z' can simultaneously exceed the threshold x, and 

ii) that the corresponding templates can only be nearest-neighbours, featuring the largest (absolute) covariance. As 
already noted, the largest (absolute) covariance pu occurs between Yij and so that Mohanty's argument 
applies naturally to the cumulative distribution of each and any reduced noncoherent correlator Z. 
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Hence 0, 

F { z e) (x) Kil-Ne Prob[Z itj > x\no signal] + {N e - 1) Prob[Z 4J > x, Z t+1 .j > x\no signal]. (4.5) 

This can be used to obtain the mentioned e factor by comparison with 

F { z Ne) (x) « [1- exp(-x)] eNe sa 1 - e7V exp(-x), (4.6) 

(binomial approximation, large thresholds) yielding: 

e»1-(1- ATe^Prob^i+ij > x\Z. hj > x, no signal]. (4.7) 

The above quoted results do not allow to make any rigorous statement insofar as the behaviour of F% is concerned 
for large N and/or large thresholds (beyond values for which numerical checking is feasible). In addition it is not 
clear whether using (|4.5|l gives a conservative estimate of the threshold. 



B. Mohanty's formula as a lower bound 



In this section we derive Mohanty's formula following a different route, whose validity does not rest on any of the 
ad hoc hypotheses made in ^^]- We shall further prove that Mohanty's formula is in fact a CDF lower bound. 
We start from the following Bonferroni-type inequality [24[ : 



Prob 



= Prob 



N-l 



|J (Ei-EiHEi+^UEit 



< 



N-l N N-l 

< Prob [Ei -Eif] E l+l ] + ¥xoh[E N ] = ^ Prob^] - ^ Prob^ n E i+1 ], (4.8) 

i—1 i—1 i—1 

which holds for arbitrary events {Ei}^L 1 . Equation l|4.8|l can be used to derive several lower bounds for the cumulative 
distribution of the reduced noncoherent correlator Z. Indeed, letting Ei = {Zij > x}, i — 1,2, ...,N@, where 
j € (1, 2, ... , JVa) is fixed, one gets: 

N B 

(x) = 1 — Prob[ max Z^j > x\no signal] > 1 — ^^Prob[ Zij > x\no signal ] + 



ATe-l 

+ Prob[ Zij > x, Z i+ ij > x\no signal ]. (4.9) 

i=l 

Equation (|4.9|l yields different lower bounds, depending on the way the templates are indexed. The tightest bound 
is obtained whenever the indexing is such that consecutive templates in (|4.9|l exhibit the largest covariance. In view 
of eq. i|2.3(J[l this occurs invariably between templates corresponding to the same intrinsic parameters, and differing 
only by a single time-discretization bin in r p5| . 

Taking into account the stationarity of the sequence Z^j, eq. (|4.9I) can be cast in the form: 

F^ Ne) (x) > 1 - AT e Prob[ Z hj > x\no signal ] + (N e - l)Prob[ Z hJ > x, Z l+1J > x\no signal ]. (4.10) 
Both probability terms in (|4.1U|) can be evaluated explicitly, as shown in detail e.g. in |2jj, yielding: 

Prob[Zjj > x\no signal] = cxp(— x), (4-11) 

and 

Prob[Zy > x, Z l+1 , > x\no signal] = (1 - p\ { ) £ 1 1 J_ , (4 .12) 

fe=o 1 E\ K -r 1 > 1 ) 
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where •) is the incomplete Euler gamma function [^J. Equation (|4 . 10|1 can be conveniently written in a more 

compact form as: 

F { z Ne \x) > 1 - N e exp(-x)e(Ne,x,p M ), (4.13) 

where: 

e(N e ,x, PM ) = 1 - (1 - N?) exp(^)(l - p 2 M ) £ gM^+lL^Mj gl (414) 

fc=0 *M* + -MJ 

Comparison of (|4.14l) with Mohanty's formula (|4.7|l reveals that this latter gives a 

In principle, inequality (|4.8|) can be used to derive lower bounds for the cumulative distribution of the whole-bank 

supremum as well. For the sake of simplicity, we refer to the case of Newtonian templates, featuring a single 

intrinsic parameter, namely the chirp mass. Under this assumption, the N = N@N& correlators can be conveniently 
cast in lexicographic order: 

Z id -> Z i+u _ 1)Ne , i = l,2,...,N @ , j = 1,2,...,7V A . (4.15) 

From (|4.15() and Fig. 1, it is seen that there exist N A — 1 consecutive correlator pairs whose covariances are negligible. 
It turns out that the relevant number of correlator pairs exhibiting the maximum (absolute) covariance pu is NqN a — 
(TVa — 1) — 1. Accordingly, letting Ei = {Zi > x} in equation l|4.8|l gives: 

N e N & 

Fj^\x) = 1 — Prob[maxZi > x|no signal] > 1 — Prob[Zj > x\no signal] + 

i — ^ 

i=l 

+ y~] Prob[Zi > x,Z l+ i > x\no signal], (4.16) 

i=i 

where the meaning of Zi is obvious from (|4.15|) . Equation (|4.16|) is conveniently rephrased as: 

F ( x N \x) > l-N e N A exp(-x)e(N e ,x,p M ). (4.17) 

C. Nested Mohanty's Formula 

Inequality 14. 8|) can be notably invoked recursively to provide hopefully more accurate lower bounds for the CDF of 
the supremum of the whole bank of (squared noncoherent) correlators. Indeed, letting Ej = {Zj > x}, j = 1, 2, . . . , 
in 14.8fl . we obtain: 

F^\x) = 1 — Prob[maxZj > x|no signal] > 

3 

JV A JVa-1 

> 1 — Prob[-Zj > x|no signal] + Prob[Zj > x,Zj + i > x\no signal]. (4-18) 

j=i j=i 

Again, by statistical uniformity and stationarity, 

F^\x) >l-N A Prob[Zj > a; | no signa 1] • |l-(l-A r A 1 ) Prob[Z j+1 >x\Z j >x,no signal]} , (4.19) 
for any j. One can use eq. (14. 13|> in 14.19[) . to obtain 

,x,pm) ■ {l — (1 — N A X ) Prob[Z 3+ i > x\Zj >x,no signal]} . (4.20) 
There is no simple recipe for computing the conditional probability in (|4.20l) . However one might speculate that 

Prob[Zj + i > x\Zj > x.no signal] > Piob[Zij+i > x\Z m j > x,no signal], (4-21) 
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for any is (1, 2, ... , Nq) and any j 6 (1,2,..., TVa — 1), where me (1,2,..., N@) is defined by the condition [2^| 

p' M = \E[Y^ +1 Y^ 3 ]\ > \E[Y id+1 Y^}\ , Vh + m. (4.22) 
Under this assumption, equation l|4.20|l yields: 

F { x N \x) >l-N A N @ exp(-x)e(Ne,x,p M )e(N A ,x,p' M ). (4.23) 

Equation (|4.23|) can be readily generalized to post-newtonian intrinsic parameter spaces of dimension higher than 
one, by iterating the above reasoning. It is also recognized that eq. (|4.17|l corresponds to neglecting the covariance 
between reduced templates in (|4.23[1 . 



D. Asymptotics 

We shall now discuss the asymptotic form of F^' as N — ► oo. This question is properly dealt with in the frame 
of extreme- value theory (henceforth EVT). Note that the study of the above asymptotics via numerical simulation 
is almost impossible (the tail behaviour requires a huge number of simulations) unless some a-priory information is 
available. 

The EVT philosophy can be understood intuitively by noting that the cumulative distribution F^ of the supremum 
of a set of N (equidistributed, unimodal) random variables displays the following universal behaviour as N — > oo: i) 
becomes steeper and steeper, tending to a step-like function, and ii) the CDF inflection point moves forward in x 
indefinitely. As a result, in the limit N — > oo, the CDF of the supremum is zero everywhere, except at infinity, where 
its limiting value is one. EVT is a general framework for dealing with the above limiting process in a meaningful 
way and relies on a suitable TV-dependent change of variable such that the location of the inflection point of 
F^ N ^ becomes independent of TV. It turns out that for most independent random variables, it is possible to find two 
sequences {a^} and {b^ > 0} such that 

lim F (N \a N + b N x) (4.24) 

exists, and can only be one among three universal functions: the Frechet, Weibull and Gumbel distributions |3(ij |. 

Using EVT we can readily prove that as V — > 00 the cumulative distribution (|4.2|l , converges uniformly to the 
Gumbel distribution (4 Hi- 

In fact, letting x = y + log V in 1)4.2(1 one gets: 

/ -v\ N 

$W( y + logiV)=(l-^ r J ^e--. (4.25) 

Switching back to the original variables, one gets: 

(x) ~ exp[- N exp(-x)] = G w (x). (4.26) 

It is also readily proved that the Gumbel distribution G^ N \x) is an upper bound for ^^\x), for any V and x, as 
illustrated in Fig. 2. From the inset in Fig. 2, one can appreciate that sup^, [G^ (x) — 9^\x)] drops below 10~ 5 
already at V - 10 4 . 

For the general realistic case where the covariance among the correlators is nonzero the existence of the a n and b n 
suites in 14.24|l is not guaranteed, nor the coincidence with their zero-covariance counterparts is implied. We prove 

in Appendix A that for any nonzero (fixed) template spacing, i.e., for any nonzero covariance, F% (x) converges 
uniformly to the Gumbel distribution in the limit as N — > 00, viz.: 

FW(y + logN) N -^?e-°- y . (4.27) 

The rate of convergence is more or less obviously slower the smaller the template spacing, i.e., the larger the maximum 
(absolute) covariance. This is illustrated in Fig. 3. 
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E. Beyond Mohanty-type Formulas 



Inspection reveals that the r.h.s.'s of Mohanty-type formulas (|4.13() . i|4.17[) . H4.23[) are bad candidates as CDF 
approximants. As x — > 0+ they become negative. Even worse, they do not exhibit the correct (Gumbel) asymptotic 
behaviour as N — > oo. Both these flaws can be remediated by introducing the exponential representations 



f ^ e \x) = e X p[-Nee{Ne,x,p M )eM-x)] 

y { x\x) = exp [-N & N A e(N @ , x, p M ) exp(-x)] 
I &x\x) = cx.p[-N e N A e(N e ,x 1 p M )e(N A ,x,p / M )exp(-x)] 



(4.28) 



which are always non negative, have the correct (Gumbel) asymptotic behaviour, and reproduce Mohanty's formulas 
(|4.13|1 . 14.17(1 . H4.23JI for x suitably large. We shall prove soon that l(4.28|l are in turn lower bounds for the corresponding- 
exact CDF's. We note from KM that 



Qx\ x ) 



N/M 



VM < N, N > 1, 



(4.29) 



where Qx can be any of the distributions in (|4.28l) . 



F. Extrapolating Simulations: A Gaussian Correlation Inequality 

The inequality 

F x N) (x) > [F ( x M \x)} N / M , M < N (4.30) 

where F ( x M \x) is the CDF of the supremum of any subset of the whole bank containing M < N correlators follows 
directly from the so called Gaussian Correlation Inequality (henceforth GCI) More or less obviously, the upper 
bound in l|4.30|l is attained as M — > N. 

Equation (|4.3l)|l is a key to demonstrate that equations in l|4.28|l are lower bounds for the pertinent true cumulative 
distributions. 

As an example, let us consider the case of V F Numerical simulations show that for relatively small banks, 

F X M \ X )>¥ X M \ X ). (4.31) 

On the other hand, in view of eq. (|4.29|) 

= *yp(x), N»l, (4.32) 
and hence, from 14.30JI . 

F$P(x) > §^°(x), VJV » 1. (4.33) 

The GCI can be used to extend the validity of approximate forms of Fx tested on relatively small banks to full-size 
banks, for which numerical simulations would be impossible or unwieldy. 



G. Accuracy and Thresholds 

The numerical accuracy of the (lower-bound) approximants (|4.28() is illustrated in Fig.s 4 and 5, obtained from 
numerical simulations over ~ 2 • 10 4 realizations. 

Figure 4 refers to the CDF F ( z Ne) of a single reduced correlator, and shows that the approximation provided by 

*^ e) is excellent. Fi gure 5 refers to the CDF of the supremum of a whole bank. For illustrative purposes, we stick 
at the simplest case of the one-parameter family of (reduced) newtonian waveforms in the range of chirp masses 
(0.2M© < M < 10M Q ). We assume the (initial) LIGO noise PSD pl |. 




(4.34) 
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with fo = 300Hz, = 40Hz and f s — AOOHz and IIo is a constant of no concern to us here. We sample the 
data at 3.2KHz (4-times the Nyquist rate), and let Nq — 2 21 , which corresponds to the longest- lived source allowed 

{M. = 0.2M Q ). Fig. 5 shows that the approximation provided by i s fairly good, and notably better than that 

obtainable from x ■ As N (resp. Nq) goes to infinity all approximants (|4.28|) merge by construction into the 
Gumbcl distribution. This latter is indistinguishable from the independent-correlator approximation in both Fig.s 
4 and 5. However, as seen from Fig.s 4 and 5, in most situations of practical interest for (possibly hierarchical) 
GW chirp search, eq.s (|4.28|) would provide significantly better approximants to the relevant CDF's than the plain 
independent-correlator (or Gumbel) approximation. Being lower bounds, eq.s l|4.28|l are also conservative in terms of 
false-alarm probability, when used to set detection thresholds. 

A sharper insight into the quality of the various proposed approximants is gained by comparing these latter in terms 
of the detection threshold corresponding to a given false-alarm probability a, as a function of This is illustrated 
in Fig. 6, for the special case a = 10 -3 . The thresholds in Fig. 6 correspond to assuming zero covariance among 
the correlators (dotted line, $^ V ' ) ), and to including the covariance among nearest-neighbour correlators along one 
(coalescency time, dashed line, ^^x^) or both (coalescency time and chirp mass, solid line, ¥p) coordinates of the 
newtonian parameter space, respectively. 

In Fig. 6, we also include the limiting case where T w pu ~ 1. The corresponding (asymptotic) value of 7 (dotted- 
dashed line in Fig. 6) is obtained using the appropriate asymptotic form of Fx(x), first quoted by Krolak jljj after 
and [H, viz.: 

F x {x) ~ 1 - K~ l fi[S'] (det[H]) 1/2 xexp(-a;), (4.35) 

where fi[-] is the Lebesgue measure, iS' is the allowed (reduced) source-parameter space, and H is the covariance matrix 
of the derivative random-field ISA. 



V. THE MINIMAL-MATCH ISSUE REVISITED 



We are now in a position to re-address the question concerning the relationship between the minimal match T 
(and/or, the number of templates in the parameter-space region of interest) and the undetected fraction of potentially 
observable sources. For simplicity, we shall again stick at the simplest case of Newtonian waveforms and templates, 
using the same range of source chirp masses and antenna noise PSD as in Sect. IIV Gl 

It should be noted first that a visible knee point in the T vs. Na curve invariably exists at some value Na = -/V*, 
beyond which any further increase in N<\ yields only a modest increase in T. 

This is illustrated in Fig. 7, where a knee point separating intervals of different (almost constant) slope in the T 
versus Na curve occurs at Na ~ 5 • 10 3 . Except for the numerical value of N*, the existence of the above knee point 
is a generic feature of the T vs. Na curve, occurring also for more realistic, higher-order post-newtonian waveforms 
and templates. 

Next we focus on eq. I|3.8[) . an d use our best approximant $^'(1) in eq. (|4.28|l for the no-signal CDF of the 
detection statistic in order to compute the ratio j(a, N rnax )/j(a, N) [3!j. In Fig. 8 we accordingly compare r) (eq. 
(|3.9|) . dashed line), to 77 (eq. I|3.8|) . solid lines), for a = 10~ k , k = 2,3,4 (top-to bottom), as functions of Na- In Fig. 
8 we assumed T max = 0.99 as a bona fide practical limiting value, and computed N max accordingly. From Fig. 8 it is 
seen that a knee point in the curves exists at Na ~ « 5 • 10 3 . It is also seen that r\ exceeds by a non negligible 5% 
typical the plain estimate 770 discussed in Sect. IHII for Na ~ N*. 

It is also interesting to compare in terms of 77 the alternative approximants for the no-signal CDF of the detection 
statistic introduced and discussed in Sect. IIVI The resulting curves are shown in Fig. 9, where again we assumed 
Fmax = 0.99 as a bona fide practical limiting value, and computed N max accordingly. The curves in Fig. 9 correspond 
to assuming zero covariance among the correlators (dotted line, ) > ano - ^° including the covariance among nearest- 
neighbour correlators along one (coalescency time, dashed line, X P^ V ' ) ) or both (coalescency time and chirp mass, solid 

line, ^^x ^) coordinates of the newtonian parameter space, respectively, in computing j(a, N max ) in l|3.8|l . For each 
case, the ij vs. Na curves corresponding to a — 1CP 2 , 10 -3 , 1CP 4 are displayed. It is seen that the dependence of n 
on a becomes the less relevant, the better approximant is used. 



VI. CONCLUSIONS 



We presented a detailed analysis of the properties of the cumulative distribution of the whole-bank supremum. A 
number of tight lower-bound approximations, of increasing accuracy, have been introduced and compared, valid for 
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any (finite) number of templates. In addition, we studied the asymptotic properties as the number of correlators goes 
to infinity, in the frame of extremal value theory (EVT). We also shew how the gaussian correlation inequality can 
be used to extend the validity of simulations made on small-sized (toy) banks. 

Concerning the minimal-match issue, we found a consistent evidence that increasing the number of templates beyond 
a critical value does not produce a sensible increase in the detectable fraction of potentially observable sources, at 
the expense of a marked growth of computational load. On the other hand, using the most accurate representation 
of the no-signal cumulative distribution of the (whole-bank) detection statistic to compute the threshold results in a 
sizeable increase in the detectable fraction of potentially observable sources over the naive oc T 3 estimate. 
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Appendix A - Proof of Equation 114.311 

In this appendix we prove that the CDF of the whole-bank supremum for the independent-correlator case is a lower 
bound for the true one. The proof relies on two preliminary Lemmas. 



Lemma A.l (Pitt's theorem |4(|)- Let [i a centered gaussian measure 41] in C, and let A, B two centered convex 
symmetric sets in C. Then 

n(Ar\B)>fi(A)n(B). (6.2) 



Remark - Pitt's theorem can be rephrased in functional (integral) form as 

E(fg) > E(f )E(g), (6.3) 

where /, g are log-concave symmetric functions and E{-) is e.g. the mean value w.r.t. to a centered complex gaussian 
measure ji. 



N 



The following Lemma is a more or less obvious extension of the above to C 



Lemma A. 2 Let /i^r a centered gaussian measure in C*. For any convex symmetric set A € C*, and any T> = 
{x € C N : \f{x)\ < r}, where / is any linear functional in and r <E R, one has 

f i N {AHV) > fM N (A)fx N {V). (6.4) 



Proof - It suffices to prove the Lemma assuming that /i n is a standard gaussian measure 42j. 

Under a suitable unitary transformation (preserving obviously the gaussian character of the measure /xjv) the set T> 
can be transformed into: 

V= {ye C N : \ yi \ <r}. (6.5) 

Now let 



ip(z)= / lA(z,v)t*N-i(dv), (6.6) 

where I^{z,v) denotes the indicator of A, z G C and v £ C*" 1 . I^{z,v) is a non negative log-concave function, 
and [i AT— i is a non negative log-concave measure, whose related PDF is also non negative and log-concave. Thus, 
according to theorem 4.6 in Bogachev |4l| . ^(z) is non negative and log-concave. The symmetry properties of A and 
/ijV-i imply that ip(z) is an even function. The level-sets tp(z) > r are thus convex and symmetric. 
Hence, letting U — {z £ C, \z\ < r}, by eq. (|6.3(l : 

ip(z)Iu(z)fjLi(dz) > / tp(z)fi 1 (dz) / I u (z)(ii(dz) = ^ N {A)^ N {V). (6.7) 
c Jc Jc 

On the other hand one has trivially: 

ip(z)Iu(z)fii(dz) = / lA(z,v)Iu(z)nN(dz x dv) = 
c Jc N 



lAnv(z,v)fi N (dz x dv) = fi N (A(l T>). (6.8) 
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Statement l|6.4|l follows from (|6.7|l and (|6.8|l • 

We are now in a position to provide the 

Proof of Eq. (jO|) - Equation gSJ in Sect. E| can be rephrased as 

N 

Probdnl <y u ...,\Y N \ <y N ) >[] Prob (l^l <Vi), (6-9) 

i=l 

where Yi, i = 1, 2, ...N are centered jointly gaussian random variables in C*. Let /z/y a centered gaussian measure in 
C N , and define the (convex, symmetric) sets: 

Ai = {aeC N : \a t \ < y t } . (6.10) 

Inequality (|6.9() can be rewritten: 

/ n \ N 



which follows by induction from Lemma A. 2 



Appendix B - Asymptotic Properties of Bank Supremums CDF 

In this appendix we prove that the CDF of the supremum of the bank correlator subsets corresponding to a fixed 
chirp mass, as well as the CDF of the supremum of the whole bank belong asymptotically to the Gumbel universal 
class, as the pertinent number of correlators goes to infinity (theorem 1 and 2, respectively). 

Throughout this Appendix we shall restrict to the simplest case of newtonian (OPN) waveforms and templates, for 
which 



^t(/,Ct) 



3 

128 



75" 



-5/3 



M T 5/3 r 5/3 



(6.12) 



where A^r is the template chirp mass |43j and c, G have obvious meanings. 

It is convenient to introduce the following dimensionless variables and parameters: 



t * - f KA M X 3 



nGfiMz 



-5/3 



where M & is the solar mass. 

The complex random field l|2.23f) and the pertinent absolute covariance (|2.27|l in turn become: 



(6.13) 



Y(tt,Zt)=Y(t,M) 



(6.14) 



and 



\E[Y(t,M)Y*(?',M')}\ = 



f-7/3 



ex P i J 



271-/6 + XAf 



-5/3 



/ n 



df 



(/) 



-7/3 



df 
n(/) 



p(6,A) 



3.15) 
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where 6 = f ' - f a nd A = M'~ 5/3 - M~ 5/3 . 

Specializing eq. I|2.31[) to the newtonian case, according to (|6.14|) and l|6.15[) . we introduce the 2D stationary 
sequence 

Y Kk = Y(hS e ,k8 A ), h, k e N, (6.16) 

and further define: 

Pm+i = p(mSe,0), (6.17) 
p m +i — max p(hSe,mSA), (6.18) 

h=l,...,N e 

Sq and 5a being the discretization steps in f and M., respectively. 
We start by proving the following Lemma: 

Lemma B.l - If the equation 

n(/) + 2/n'(/) = o, (6.19) 

has at least one solution / € [1, f s ], where II(/) is the antenna noise power spectral density, then 

p m = o^). (6.20) 

Otherwise, 

Pm = o (—) . (6.21) 

Km J 



Proof - Note first that: 

p m < maxp(e, A) = p[e max (A),A], (6.22) 
where O on the r.h.s. is a continuous variable, and <d max (A) is the solution (in 0) of: 

d P (e,A) 



= 0. (6.23) 



The proof for the special case where (|6.19|) has no root in [1, f s ] follows immediately by Lebesgue theorem (integration 
by parts). 

The proof for the special case where (|6.19fl has (at least) one root in [1, / s ] is readily obtained using the asymptotic 
stationary phase formula 0] to compute the correlator (|2.1f) . The phase stationary point in l|2.1|l is: 

5 3/8 / c 3 / A x3/8 



^ = ~~8ir~ \GMq ) {e) =/ " (6 ' 24) 
and the (absolute) correlation function can be accordingly written: 

where f a depends on Q (and A) through eq. I|6.24|l . It is readily checked that df a /d<d is always nonzero, so that 
the solution of (|6.23|l are those of: 

^ = 0, (6.26) 

dfa 
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which is nothing but eq. (|6.19() . whose assumedly existing solution will be denoted as /J. Hence 

5/6 Ft-1/2 
Ja 

n(/I) ■ 

The statement follows from H6.27[) together with (|6.22(l • 



maxp(6,A)^4^/ 3 ./f (^>\°'\^ 
e n V 5 V c3 J U (ti) 



(6.27) 



Remark - For all first generation interferometers equation (|6.19() has just one solution f\ <G [1, /,]. Note that fl does 
no£ depend on A. 



Lemma B.2 - The following statement is true: 



Pm = o — 
m 



Proof - The Lemma is based on the (stationary phase) asymptotic evaluation for 9 — ► oo of the integral: 



p(o,o) 



1 rf.J-T/3 



X Ji n(/) 



exp[j27r/e]d/ 







(6.28) 



(6.29) 



We shall now introduce two ancillary conditions, which are suitable weak (asymptotic) versions of the M— dependence 
condition |45|. In particular condition #1 is similar to the mixing condition of dynamical systems |46|. 



Condition 1 - Let {Xj} a sequence of random variables, and let Fj Xt j 2t ,,, t j r (xi, X2, x r ) the joint CDF of 
Xj ± , Xj 2 , Xj r . Condition #1 holds if, for any set of integers {ii < £2, ... < i p } and {j\ < J2, ••■ < jq} such 
that j\ — i p > s, and any real number u, a real-valued function g(s) exists such that: 



with 



F Xil ,...,x ip , Xjl ,...,x jq (u, u, -,u) - F Xil ,...,x ip (u, u, u)F Xji ^, >Xjq (u, u, ...,u) < g(s), 



lim g(s) = 0. 



(6.30) 



3.31) 



Condition 2: Let {u n } a sequence of reals, and let {Xj} a stationary ^1 random sequence. Condition #2 holds if: 



. L«A'J 

lim limsup < n \ ProbLY! > u n ,Xj > u n ] > = 0. 

fc^oo n ^oo I ' I 



3.32) 



Conditions #1 and #2 above imply the following 



Lemma B.3 ( j3(|, theorem 8.5) Let {Xj} a stationary sequence of random variables and let {a n }, {b n } two sequences 
of reals. Assume that conditions #1 and #2 are satisfied with u n — a„ + b n x. Let {Xj} a sequence of i.i.d. random 
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variables having the same CDF as {Xj}, and Q(x) any of the (three) universal extremal limit distributions [13 of 
i.i.d. random variables. Then: 



and 



are equivalent. 



lim Prob 



lim Prob 

n — >oo 



jmax Xj < dfi ~t~ b n x 

j"=l,...,n 



max Xa < a n + b n 2 



g{x) 



g{x) 



(6.33) 



(6.34) 



Lemma B.3 plays a key role. Loosely speaking, it states that provided conditions #1 and #2 are satisfied, the 
supremum of a (stationary, dependent) sequence has the same CDF as if the sequence was made of independent r.v. 



We are now in a position to prove three key Lemmas. 



Lemma B.4 - Let Gi = Yi t k with k 6 N fixed, and let Zi = |G;| 2 . The Zi satisfy condition #1. 



Proof - Let {i\ < i%, ... < i p } and {ji < j2, ■■■ < j q } two set of integers such that j\ — i p > s. Let further C p 
and C q and C p+q the correlation matrices of the zero-average complex gaussian random sequences {G^, . . . , }, 
{G jl ,...,G jq }, and {G it , . . . , G ip , G n , . . . , G jq }, respectively. 
Let Ci = C p+q and C 2 =diag[C p , C q ], and G = [G il ,...,Gi p ,Gj 1 ,...,Gj q ], and let 



F ZH ,...,z H> z jl ,...,z u (u,u,...,u) 



det p+q Ci 



-G'cr'G 



and 



F Zil ,...,z ip (u, u, u)F Z]1 z js (u, u, u) 



det p+q C 



d p+q G 



- G * c 2 lG (P+«G 



(6.35) 



(6.36) 



2 J£ 



the pertinent cumulative distributions, u) being a suitable normalization constant, and £ the domain (|Gi| < y/ud 
\Gj\ < 

The Zi satisfy condition #1 iff: 



lim 

s — >oo 



-G'C, G 



d p+q G 



1 



d p+q G 



det p+q CUs" ■ det p+9 C 2 J £ 

Under the assumption 

lim E[GiG* +m }= lim E[Y itk Y* +m k ] = lim p m+1 = 0, Vi, 

m — *oo m — ^oo ' ' rn — >oo 

which holds in view of Lemma B.2, the off-diagonal blocks in Ci go to zero, and hence 

lim (Ci - C 2 ) = =S> lim (det Ci - det C 2 ) = 0. 

s — >oo s — >oo 

To prove (|6.37J1 we note that: 



0. 



(6.37) 



(6.38) 
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-G'C^G 



det p+q Ci 



1 - e -G*(C7 -Ci)G 



,-G*C~ 'G 



det p+9 Ci 



1 - 



/detCi\ p+ ' 
^det C 2 



< 



-G'C, G r 



< 



det p+q Ci 



glG^nc-^-c- 1 !! _ x 



-G*C, G 



+ 



det p+9 Ci 



detCi 
det C 2 



p+<3 



(6.39) 



where || • || is the (matrix) L 2 norm. Using further the simple inequality |G| 2 < Y^h=i l^jJ 2 + Sfc=i l^jJ 2 — (p + q) u i 
we obtain from 1)6.39)1 : 



■G'cr'G 



-G'C, G 



det p+9 d det p+9 C 2 



< 



< 



1 



det p+? Ci v 
in view of 1)6.38)1 . Equation ()6.37)) follows 



detCi 
detC 2 



p+q 



0, 



(6.40) 



The next Lemma B.5 is needed to prove Lemma B.6. 



Lemma B.5 - Let G\ and G 2 two complex standard gaussian random variables, and let: 

c=\E[G 1 G* 2 ]\<\. 

The following inequality holds: 

Prob [|Gi| 2 > u, \G 2 \ 2 >u}< i±^ e -Tfe". 
L J 1 — c 

Proof - From the standard definition of bivariate (complex) normal variables, 

4 



pOO POO x*+x 

Prob[|Gi| 2 >u,|G 2 | 2 >«] =— *— / / x^e'^^Io 



i±5i. / 2c 



1-c 2 

where I (-) is the modified Bessel function. Representing this latter in integral form ,4S|, gives: 

/>oo poo 



Prob |Gi| 2 > u,|G 2 | 2 >u\< 



Switching to polar coordinates (p, </>), 

x\ + x\ — 2cx\x 2 = p 2 (l — csm2<p) > p 2 (l — c) 

whence: 



x\x 2 e !- c2 dx\dx 2 . 



OO />00 



Prob[|Gi| 2 >u, |G 2 | 2 >u] < — *— I I x lX2 e c) d Xl dx 2 = 

J- c .J . m .1 . m 



(6.41) 



(6.42) 



x\x 2 I dxidx 2 , (6.43) 



(6.44) 



(6.45) 



1~C 2 



oo />oo 



x 2 +x 2 



1-c 2 



X\e 1 + c dx\ 



.J \/u 



1 + c 



-e !+<=. 



(6.46) 



q.e.d. 
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Lemma B.6 - Let {Hj} a sequence of complex standard gaussian random variables, let u n = logn + x, and let: 

7m+1 = \E [HxHl^] | < 1, m ? 0. (6.47) 

If, for some < (3 < oo 



7m O i i . 



1 

TO 



(6.48) 



then: 



Ln/fcj 

lim lim sup < n > Prob 

fc^oo - — I ^— ' 



u, 



= 0. 



3=2 



(6.49) 



Proof - In view of Lemma B.5, 



Ln/fcj 

S n = n Prob 

3=2 



jiJi| 2 > logn + 2E, |-ffj| 2 > logn + ,x 



Ln/fcj 



Ln/fcj 



3=2 



3=2 



(6.50) 



where: 



M = 



1 + 7 2b! 

-e'W, 7 = sup7 i <l. 

1 ~ 7 j>2 



The last inequality follows from jj < 1 Vj > 2, and from the asymptotic decay rate of 7j -. 
From ea. (|6.5()Jl one readily gets: 



(6.51) 



L n / - _i _i 

3=2 

Letting in l|6.52|) to = \n/k\, and using the obvious inequality: 



(6.52) 



k~>j +1 < k~ Vj > 2, 



eq. H6.5()|l becomes: 



3=2 



7,'+l 



The Lemma now readily follows from the fact that: 



(6.53) 



(6.54) 



m 22-1 
lim m^ +1 = 1. 

•n — >oo — ' 



3=2 



(6.55) 



Indeed letting: 



S m = ^m^-1 = ^ 



3=2 



3=2 



2t^ n i 

m 7 3 + , 

m / to 



(6.56) 



and taking limit to — > oo, one gets: 



3=2 



m — log TO 

e 1 ^ — 1 



3=2 



?7J 



(6.57) 
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Obviously, 

W>1, 3Co:e«-l<<SC, VCe[0,Co]- (6.58) 
Now let j' the smallest integer such that: 



2 7j 



1 + 7, 
Then: 

2 7 3 '~i 



logm < Co Vj>j'. (6.59) 



7f-i + 1 

and 

2^7. 



logm>Co, (6.60) 



e-^^.K j^ bgm vi>/. (6.61) 
l + 7j 



Let: 

2^ 



' * m ' * rri 



m 

We can use the inequality l|6.61|l to show that the second partial sum in l|6.62|l vanishes in the limit m — * oo. 
Indeed: 

m — -XT log TO 1 „ - , m „ r . m 

Ee T 3 +i —1 2ologrn^-^ 7, 2ologm , flN ^-^ „ 
< — > 1 — — < — max ( 7 /) y 
m ~ m ^1 + 7,- m j=j',-,m KUJ ' ^ J 
j=j' i=i' j=r 

. 2,5 log m ^ ^ r,- flJ , 2<51ogm „_ W™ 1- "-^ 

m j=j',...,m 

On the other hand in view of Ij6.51|l : 



/■ ^ f" a, 2<51ogm , fm 1 - p -l\ 

{ijf) / i _/3 4? = — . max 7j / — — . 6.63 

Ji m j=]',...,m x \ l—p J 



^eWh 1 **™-! T ^io gm _ 1 (/ _ 2)( / ! 
7 < ~ 2) = u - n '{,, ' m— 



Now we can use the inequality H6.60J) to show that the first partial sum in l|6.62|l also vanishes in the limit 



m — > 00: 



/ 2 ~ti 

E 5^_i,(/- 2 ) hy _,,-(^-l) (•»«-!). (6.64) 



J'=2 

This completes the proof 



Remark - Lemma B.6 holds for general (i.e., stationary as well as non-stationary) sequences. This most general case 
will be invoked in Lemma B.8. 



From Lemmas B.3 to B.6, one readily proves that the CDF of the supremum of the whole-bank subsets corresponding 
to a fixed chirp mass belongs asymptotically to the Gumbcl universal class, as the pertinent number of correlators 
Nq goes to infinity [49|. 
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Theorem 1 - Let Zj = \Yj. k \ 2 , k € N being fixed. If, for some < f3 < oo 



Pm = O 



then the supremum of {Zj\j = 1, . . . , Nq} is asymptotically Gumbel-distributed for Nq — > oo, viz. 



lim Prob 

Nq — too 



max Z ; < log Nq + x 

j=l,...,N e 



exp[— exp(— x)], Vfc. 



(6.65) 



(6.66) 



Proof - The Zj are exponentially distributed. If they were independent, eq. (|6.66|l would be trivial, as noted in Sect. 
rvl However, VA; G N, the sequence Zj satisfies both conditions #1 and #2, with u n — logn + x, in view of Lemmas 
B.4 and B.6. Thus by Lemma B.3, eq. I|6.66|l holds true despite the Zj being dependent • 



We turn now to prove Theorem 2. To this end we start with two relevant Lemmas. 



Lemma B.7 - Let: 



The sequence {Z k } satisfies condition #1. 



Z fe = max \Yj k \ 2 . 

j=l,...,Ne J ' 



Proof - From the obvious relation 

Prob[Z fc < u] = Prob [|Yi, fe | 2 < u, \Y 2 . k \ 2 < u, \Y Ne . k \ 2 < u] 



(6.67) 



(6.68) 



the probability Prob [Z k < u] is a joint probability of gaussian complex variables. The proof follows from Lemma B.4 



Lemma B.8 - If, for some < (3 < oo 



Pm = O 



1 



then the sequence {Zk} satisfies condition #2, with u n — logn + x. 



(6.69) 



Proof - Let 



and let 



E, 



Clearly Eh = \ \Ej t h, and hence: 



Prob[^i n E k ] = Prob 



IK* 

\i=l / 



. True, Z h > u n 
Eh = { False, Z h <u n > ' " ' ' ' 



True, \Yj, h \ 2 > u n 
\ False, | | 2 <u n 



j = l,...,N e h = l,...,N A . 



i U E ^ 



Prob 



JV e 

|J (£yn%) 



(6.70) 



(6.71) 



< Prob [^.i n E iA 



(6.72) 



Now condition #2 for Zk can be written: 



On the other hand, 



But in view of Lemma B.6, 



and the proof is complete • 



From Lemmas B7 and B8, we can finally prove 



Theorem 2 - If, for some < (3 < oo 



then the supremum of {Zk\k = 1, . . . , N&} is asymptotically Gumbel distributed for Na — * oo, viz 



lim Prob 



max Zk = max max \Yj k\ 2 <\o&(NqN&) + x 

fc=l,...,7V A fe=l,...,iVA3=l,...,JVe 



Proof - The sequence {Zk} satisfies both conditions #1 and #2, and thus by Lemma B.3 



lim Prob 

Na— >oo 



max Zk < \og(NeNA) + x 
k=i,...,N A 



lim [F z „ (\og(N e N A ) + x)] N& . (6.78) 
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[n/mj 

lim lim sup < n 7 Prob [Zi > w n , > u n ] > = 0. (6.73) 



[n/mj ! iV G [ |n/™J 

lim lim sup < n > Prob [Z\ > u ni > u n ] > < > lim lim sup < n 7 Prob [Ei 1 n i£ 7 -.fc] > . (6.74) 



[n/mj 

lim lim sup ^ n V Prob [EnnE jk ] \ = 0, Vi,j. (6.75) 



Pm = o I — - ) , (6.76) 



= exp[- exp(-x)] . (6.77) 



The explicit form of the r.h.s. of (|6.78|) . which corresponds to the case where the Zk are assumed as independent is 
as yet unknown, such being the CDF Fz k - However, by the inclusion-exclusion principle |50j : 

1 - F Zk (x) = AfeProb [Zj > x] - Prob [Z t > x, Zj > x] + ^ Prob [Z. t > x, Zj > x, Z h > x] - ... (6.79) 

i>j i>j>h 

In the limit as x — > oo, the r.h.s. of (|6.79() is infinitesimal. The first term in (|6.79() is the principal part. Indeed, for 
any (non empty) set of integers %2, i p } different from i and j, and for all x 

Prob [Zi > x, Zj > x] > Prob [Z, > x, Zj > x, Z n > x, Z lp > x] . (6.80) 

Furthermore, from Lemma B.5, 

Prob [Zi > x, Zj > x] ~ exp(— vox), w > 1, (6.81) 

while 

Prob [Zj >x}= exp(-x). (6.82) 



Hence, 
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lim , \lr k{X \ m = to ff* - 1, (6-83) 

z^oo l — exp|— We exp(— xjj z^oo iVe exp(— x) 



which means that the two distributions Fz k (x) and exp[— iVe exp(— x)\ are right tail equivalent. Under this assumption, 
theorem 3.15 in [8(1 | states that the domain of attraction of the two distributions is the same, viz: 

lira {F Zk {\og(N N A ) + x}} NA = lim {exp [-N& exp (- \og(N e N A ) - x)]} Na = exp[- exp(-z)]. (6.84) 
Letting (|6.84l) into (|6.78|) proves the theorem • 



Remark - Theorem 2 shows that (uniform) convergence is achieved in the limit N A — > oo at any fixed Nq. We add 
without proof that the larger iVe, the faster the convergency rate. 
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Captions to the Figures 



Fig. 1 - Newtonian template bank with lexicographic ordering. 

Fig. 2 - Difference between the Gumbel distribution G N (x) and the cumulative distribution of the whole-bank 
supremum ^^\x) corresponding to the assumption of statistically independent correlators vs. x — \ogN for various 
values of N. The supremum of the difference is plotted in the inset vs. log 2 N. 

Fig. 3 - Difference between the Gumbel distribution G N °(x) and the cumulative distribution of a single reduced 
correlator vs. x — logiVe, for several values of iVe, and two different values of pm- 



Fig. 4 - Comparison among several approximants for the cumulative distribution of a single reduced correlator vs. 
x — log iVe- Numerical simulation (10 4 realizations). 



Fig. 5 - Comparison among several approximants for the cumulative distribution of the whole-bank supremum vs. 
x — logiVe. Numerical simulation (10 4 realizations). 



Fig. 6 - Detection threshold 7 corresponding to a = 1 CP 3 obtained from different models of the no-signal cumulative 
distribution of the whole-bank supremum vs. number of reduced templates Na- The dotted-dashed line is the 
continuous limit, eq. (|4.35() . Search range O.27W < M < WMq, Newtonian waveforms, Ligo-I noise. 



Fig. 7 - Minimal match T vs. number of reduced templates Na- Search range 0.2.M© < M. < 10.M©, Newtonian 
waveforms, Ligo-I noise. 



Fig. 8 - Detectable fractions r\ (eq. (|3.8|) . solid line) and r/o (eq. (|3.9fl . dashed line) of potentially observable sources 
vs. number of reduced templates Na, for several false-alarm levels. Search range O.2A^0 < M. < IOMq, Newtonian 
waveforms, Ligo-I noise. 



Fig. 9 - Detectable fractions r\ of potentially observable sources, obtained from different models of the no-signal 
cumulative distribution of the whole-bank supremum vs. number of reduced templates Na, for several false-alarm 
levels. Search range 0.2.M© < A4 < I0.M©, Newtonian waveforms, Ligo-I noise. 
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